35 



45 
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SEQUENCE LISTING 



(1) GENERAL INEO^PVTION: 



10 (i) APPLICANT: 

(A) NAME: Boehringer Ingelheim International QutoH 

(B) STREET: Binger Strasse 173 

(C) CITY: Ingelheim am Rhein 
(E) COUNTRY: Germany 

15 (F) POSTAL O0CE (ZIP) : 55216 

(G) TELEPHONE: 06132/772282 

(H) TELEFAX: 06132/774377 



2 0 (ii) TITLE OF INVENTION: Turor-associated Antigen 

(iii) NUMBER OF SEQUENCES: 28 

25 

(iv) OCMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) OCMPOTER: IBM PC oorpatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

3 0 (D) SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 



(2) INFORMATION FOR SEQ ID NO: 1: 



(i) SEQUENCE CHARACTERISTICS: 
(A) I£H?IH: 679 base pairs 
40 (B) TYPE: nucleic acid 

(C) STRANDECNESS: single 

(D) TOPOLOGY: linear 



(ii) MXBCULE TYPE: cDNA to iriRNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: horo sapiens 
(F) TISSUE TYPE: MslanoiH 

(ix) FEATURE: 
55 (A) NAME/KEY: 3 T UTR 



SUBSTITUTE SHEET (RULE 26) 



WO 00/23584 



2/18 



PCT/EP99/07832 



(B) LOCATION: 340.. 679 



fix) FEATURE: 
5 (A) NAME/KEY: 5 f UTR 

(B) LOCATICN:1..9 

(ix) FEATURE: 

(A) NAME/KEY: CDS 
10 (B) IOCATICN:10..339 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

15 CGAOGGGOG ATG CIG ATG GCC CAG GAG GOC CIG OCA TTC CIG ATG GOC 
Met Leu Met Ala Gin Glu Ala Leu Ala Phe Leu Met Ala 
15 10 



30 



35 ACA TCA OGA TQC CIT TCT OCT OGC OCA TOG AAG OQG AQC TGG TOC GCA 
Tor Ser Arg Cys leu Ser Arg Arg Pro Trp Lys Arg Sex Trp Ser Ala 
80 85 90 



40 



50 



48 



CAG GGG GCA ATG CIG GOG GOC CAG GAG PGG OGG GTG OCA COG GOG GCA 96 
2 0 Gin Gly Ala Met Leu Ala Ala Gin Glu Arg Arg Val Pro Arg Ala Ala 
15 20 25 

GAG GTC OCC GGG GOG CAG GOG CAG CAA GQG OCT OQG GOC OGA GAG GftG 144 
Glu Val Pro Gly Ala Gin Gly Gin Gin Gly Pro Arg Gly Arg Glu Glu 
25 30 35 40 45 

GOG OCC CGC GQG GIC OGC ATG GOG GIG O0G CTT CIG OGC AGG ATG GAA 192 
Ala Pro Arg Gly Val Arg Met Ala Val Pro Leu Leu Arg Arg Met Glu 
50 55 60 



GGT GOC OCT GCG GQG OCA GGA GGC OGG ACA GOC GOC TOC TTC ACT TGC 240 
Gly Ala Pro Ala Gly Pro Gly Gly Arg Ihr Ala Ala Cys Phe Ser Cys 
65 70 75 



288 



GGA TOC TCT O0C GGG ATG OCG CAC CTC TOC OOC GAC CAG GGG OGG TTC 336 
Gly Ser Cys Pro Gly Met Pro His Leu Ser Pro Asp Gin Gly Arg Phe 
95 100 105 



TGA AGGACITCAC OCTCT003QC AAOCTACTCT TTATOOGACT GACIGCIQCA 389 
45 110 



GAOCAOOQOC AACTGCAGCT CIOCATCAQC TOCTGICICC AGCAGCnTC OCTGTTGATG 449 

TGGATCAOQC AGIGCTTTCT GOOOGTGTTT TTGQCICAGG CTOOCICAGG GCAGAGGOQC 509 

TAAGOOCAGC CTGGOQO0OC TTOCIA3GTC ATGOCTOCTC OOCEAGGGAA TGGICOmGC 569 

AOGAGIGGOC ACTTCAITGT GQQQGOCTGA TTGTTKCTOG CTGGAGGAGG AOGQCTTACA 629 

55 TUrriwrriC TGEAGAAAAT AAAGCTGAGC TACGAAAAAA AAAAAAAAAA 679 



SUBSTITUTE SHEET (RULE 26) 



(2) JNEOBmriW FOR SEQ ID NO: 2: 

5 (i) S0QUENCE CHARACTERISTICS: 

(A) LENGTH: 109 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

10 (ii) MXECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SFQ ID NO: 2: 

Met Leu Met Ala Gin Glu Ala Leu Ala Fhe Leu Met Ala Gin Gly Ala 
15 1 5 10 15 

Met Leu Ala Ala Gin Glu Arg Arg Val Pro Arg Ala Ala Glu Val Pro 
20 25 . 30 

2 0 Gly Ala Gin Gly Gin Gin Gly Pro Arg Gly Arg Glu Glu Ala Pro Arg 
35 40 45 



25 



35 



55 



Gly Val Arg Met Ala Val Pro Leu Leu Arg Arg Met Glu Gly Ala Pro 
50 55 60 

Ala Gly Pro Gly Gly Arg Thr Ala Ala Cys Phe Ser Cys Thr Ser Arg 
65 70 75 80 



Cys Leu Ser Arg Arg Pro Trp Lys Arg Ser Trp Ser Ala Gly Ser Cys 
3 0 85 90 95 



Pro Gly Met Pro His Leu Ser Pro Asp Gin Gly Arg Hie * 
100 105 110 



(2) INFORMATION FOR SEQ ID NO: 3: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 767 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDECNESS: single 
(D) TOPOLOGY: linear 

45 

(ii) MOLECULE TYPE: cCNA to iriRNA 

(iii) HYPOTHETICAL: NO 

50 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: hero sapiens 
(F) TISSUE TYPE: Malanona 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 54.. 596 
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(ix) FEATURE: 

(A) NAME/KEY: 3'UTR 

(B) LOCATION: 597.. 767 



(ix) FEATURE: 

(A) NAME/KEY: 5'UTR 

(B) LOCATION:!. .53 



(xi) SEQUENCE DESCRIFTIOSI: SEQ ID NO: 3: 

AT0CIOCTQ3 GOOCTGAOCT TCTCTCTGAG AGOOQGQCAG A3GCTOOQGA GOC ATG 56 

Met 
1 

CAG GOC GAA QGC CAG QGC ACA GQG QGT ICG A3G GGC GAT GCT GAT GOC 104 
Gin Ala Glu Gly Gin Gly Thr Gly Gly Ser Thr Gly Asp Ala Asp Gly 
5 10 15 

OCA GGA GOC OCT GGC ATT OCT GAT GGC OCA GQG GOC AAT GCT GGC GGC 152 
Pro Gly Gly Pro Gly He Pro Asp Gly Pro Gly Gly Asn Ala Gly Gly 
20 25 30 

OCA GGA GAG GOG OCT GOC AOG QGC GGC AGA GGT OOC OQG GGC GCA GQG 200 
Pro Gly Glu Ala Gly Ala Thr Gly Gly Arg Gly Pro Arg Gly Ala Gly 
35 40 45 

GCA GCA AGG GOC TOG GQG OOG AGA GGA QGC GOC O0G OGG GGT O0G CAT 248 
Ala Ala Arg Ala Ser Gly Pro Arg Gly Gly Ala Pro Arg Gly Pro His 
50 55 60 65 

QGC GGT GOC GCT TCT GOG CAG GAT GGA AGG TQC OOC TQC GQG GOC AGG 296 
Gly Gly Ala Ala Ser Ala Gin Asp Gly Arg Cys Pro Cys Gly Ala Arg 
70 75 80 

AGG OOG G£C AGO OOC CTG CTT CAG TTG CSC ADC AOG ATG OCT TIC TOG 344 
Arg Pro Asp Ser Arg Leu Leu Gin Leu His He Thr Met Pro Fhe Ser 
85 90 95 

TOG OOC ATG GAA GOG GAG CTG CTC 09C P£G ATC CIG TOC OQG ®T GOC 392 
Ser Pro Mat Glu Ala Glu Leu Val Arg Arg He Leu Ser Arg Asp Ala 
100 105 HO 

GCA OCT CTC OOC OGA OCA GGG GOG GTT CTG AAG GAC ITC AOC GIG TOC 440 
Ala Pro leu Pro Arg Pro Gly Ala Val Leu Lys Asp Hie Thr Val Ser 
115 120 125 

QGC AAC CEA CTG TTT ATC OGA CTG ACT OCT OCA <3*£ CAC OOC CAA CTG 488 
Gly Asn I^u Lsu Phe He Arg Leu Thr Ala Ala Asp His Arg Gin Leu 
13 0 135 140 145 

CAG CTC TOC ATC AGO TOC TGT CTC CAG CAG CIT TOC CTG TTG ATG TOG 536 
Gin Leu Ser He Ser Ser Cys Leu Gin Gin Leu Ser Leu Leu Met Trp 
150 155 160 
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10 



15 



40 



50 



55 



ATC AOG CAG TGC TIT CTG OCC GIG TTT TIG GCT CAG GOT COC TCA GGG 584 
lie Thr Gin Cys Ene Leu Pro Val Phe Leu Ala Gin Ala. Pro Ser Gly 
165 170 175 

CAG AGG OQC TAA GOOCAGOCTG GCGOOOCTTC CTAGGICATC CCTOCTO03C 636 
Gin Arg Arg * . 
180 

TAGQGAATGG TOCCAGGAOG AGTGQOCAGT TCATIGIGGG GQOCTGATIG TTTGTOGCTG 696 

GAGGAQGAOG GCTTACATGT TIGITIUIGT AGAAAATAAA GCIGAQCEAC GAAAAAAAAA 756 

AAAAAAAAAA A 757 



(2) INEDFMRXION FOR SEQ ID ND: 4: 

2 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 180 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

25 (ii) M3LSCULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Gin Ala Glu Gly Gin Gly Thr Gly Gly Ser Thr Gly Asp Ala Asp 
30 1 5 10 15 

Gly Pro Gly Gly Pro Gly lie Pro Asp Gly Pro Gly Gly Asn Ala Gly 
20 25 30 

3 5 Gly Pro Gly Glu Ala Gly Ala Thr Gly Gly Arg Gly Pro Arg Gly Ala 

35 40 45 



Gly Ala Ala Arg Ala Ser Gly Pro Arg Gly Gly Ala Pro Arg Gly Pro 
50 55 60 

His Gly Gly Ala Ala Ser Ala Gin Asp Gly Arg Cys Pro Cys Gly Ala 
65 70 75 80 



Arg Arg Pro Asp Ser Arg Leu Leu Gin Leu His lie Thr Met Pro Phe 
45 85 90 95 



Ser Ser Pro Met Glu Ala Glu Leu Val Arg Arg He Leu Ser Arg Asp 
100 105 HO 

Ala Ala Pro Leu Pro Arg Pro Gly Ala Val Leu Lys Asp Phe Thr Val 
115 120 125 

Ser Gly Asn Leu Leu Phe He Arg Leu Thr Ala Ala Asp His Arg Gin 
130 135 140 

Leu Gin Leu Ser He Ser Ser Cys Leu Gin Gin Leu Ser Leu Leu Met 
145 150 155 160 
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Trp lie Thr Gin Cys Fhe Leu Pro Val Phe Leu Ala Gin Ala Pro Ser 
165 170 175 

Gly Gin Arg Arg * 
5 180 



10 



20 



30 



(2) INFOR^ITCN FDR SEQ ID NO: 5: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 993 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS: single 
15 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cENA to mRNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANIC: hcno sapiens 
25 (F) TISSUE TYPE: Melanoma 



(ix) FEATURE: 

(A) NAME/KEY: 5'UTR 

(B) LOCATION: 1. .55 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 56.. 688 

35 (ix) FEATURE: 

(A) NAME/KEY: 3'UTR 

(B) LOCATION: 689.. 993 



40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



45 



GCATOGTOGT GQGOOCTGAC CITCTCTCTG AGAG3GQGGC AGAQGCTOOG GSGCC ATC 58 

Met 
1 

CAG QOC GAA QGC CAG QGC ACA GGG GGT TOG PCG GQC GAT GCT GAT QQC 106 
Gin Ala Glu Gly Gin Gly Thr Gly Gly Ser Thr Gly Asp Ala Asp Gly 
5 10 15 



50 Cd GGA QGC OCT GQC ATT OCT GAT GQC CCA GOG QGC AAT GCT GQC GQC 
Pro Gly Gly Pro Gly lie Pro Asp Gly Pro Gly Gly Asn Ala Gly Gly 
20 25 30 



154 



OCA GGA GAG GOG GOT QOC AOG QGC GQC AGA GGT OOC OQG GGC GCA GGG 202 
55 Pro Gly Glu Ala Gly Ala Thr Gly Gly Arg Gly Pro Arg Gly Ala Gly 
35 40 45 

OCA GCA AGG GOC TOG GQG OOG AGA GGA GQC GOC OOG 03G GGT O0G CAT 250 
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20 qgc mz cm ctg ttt atc tca git oqg mz cag gac agg gaa gqc gct 

Gly Asn Leu Leu Phe Ktet Ser Val Arg Asp Gin Asp Arg Glu Gly Ala 
130 135 140 145 



45 
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Ala Ala Arg Ala Ser Gly Pro Arg Gly Gly Ala Pro Arg Gly Pro His 
50 55 60 65 

QGC QGT GOC OCT TCT GCG CAG GAT GGA MG TGC GOC TQC GGG GOC AGG 298 
5 Gly Gly Ala Ala Ser Ala Gin Asp Gly Arg Cys Pro Cys Gly Ala Arg 
70 75 80 

AGG O0G GAC AGC OQC CTG CTT CAG TTG C?C ATC AOG AIG OCT TIC TOG 346 
Arg Pro Asp Ser Arg Leu Leu Gin Leu His lie Thr Met Pro Phe Ser 
10 85 90 95 

TOG COC ATG GAA GOG GAG CTG GTC 03C AGG ATC CTG TOC COG GAT GOC 394 
Ser Pro Met Glu Ala Glu Leu Val Arg Arg He Leu Ser Arg Asp Ala 
100 105 HO 



55 



OCA OCT CIC OOC OCA CCA GGG GOG GTT CTG AAG GAC TTC AOC GIG TOC 442 
Ala Pro Leu Pro Arg Pro Gly Ala Val Leu Lys Asp Phe Thr Val Ser 
115 120 125 



490 



GOG OGG ATG AGG GIG GIG GGT TGG GGG CIG QGA TCC GCC TCT OOG GAG 538* 
25 Gly Arg Met Arg Val Val Gly Trp Gly Leu Gly Ser Ala Ser Pro Glu 

150 155 160 

GGG CAG AAA OCT Affi GAT CTC AGA ACA COC AAA CAC AAG GTC TCA GAA 586 
Gly Gin Lys Ala Arg Asp Leu Arg Thr Pro Lys His Lys Val Ser Glu 
30 165 170 175 

CAG AGA OCT QGT ACA OCA GQC OOG COG CCA OOC GAG GGA GOC CAG GGA 634 
Gin Arg Pro Gly Thr Pro Gly Pro Pro Pro Pro Glu Gly Ala Gin Gly 
180 185 190 

35 

GAT GGG TQC AGA QGT GTC GOC TTT AAT GIG ATG TTC TCT GOC OCT CAC 682 
Asp Gly Cys Arg Gly Val Ala Phe Asn Val Met Phe Ser Ala Pro His 
195 200 205 

40 AIT TAG O0GACTGACT GCTQCAGADC -M0Q0CAACT GCK3CTCTOC ATCAQCTOCT 738 
lie * 
210 



GTCTOCAGCA GCTTTO0CTG TTGATCTGGA TCAOQCAGTG CTTTCTQOCC GTGTTTTTQG 798 

CICAGGCTOC CTCAGGQCAG AGGCGCTAAG O0CAGOCTQG OQOOXTTOC TAGGICAIGC 858 

CTOCTOOOCT AGGGAATGGT OOCAGCACffi GK3Q0GAGTT CATTCTGQGG GOCIGATIGT 918 

5 0 TrGTOGCTQG AGGAQQ^CQG CTEACATGTT TGTTTCTGEA GAAAATAAAG CTGAGCTACG 978 

AAAAAAAAAA AAAAA 993 



(2) 3MOHMATICN FDR SEQ ID NO: 6: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 210 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLEDJLE TYPE: protein 

(xi) SEEPENCE DESCRIPTION: SEQ ID NO: 6: 

1 0 Met Gin Ala Glu Gly Gin Gly Thr Gly Gly Ser Thr Gly Asp Ala Asp 
15 10 15 



15 



30 



45 



55 



Gly Pro Gly Gly Pro Gly He Pro Asp Gly Pro Gly Gly Asn Ala Gly 
20 25 30 

Gly Pro Gly Glu Ala Gly Ala Thr Gly Gly Arg Gly Pro Arg Gly Ala 
35 40 45 



Gly Ala Ala Arg Ala Ser Gly Pro Arg Gly Gly Ala Pro Arg Gly Pro 
2 0 50 55 60 

His Gly Gly Ala Ala Ser Ala Gin Asp Gly Arg Cys Pro Cys Gly Ala 
65 70 75 80 

2 5 Arg Arg Pro Asp Ser Arg Leu Leu Gin Leu His He Thr Met Pro Phe 

85 90 95 



Ser Ser Pro Met Glu Ala Glu Leu Val Arg Arg He Leu Ser Arg Asp 
100 105 110 

Ala Ala Pro Leu Pro Arg Pro Gly Ala Val Leu Lys Asp Phe Thr Val 
115 120 125 



Ser Gly Asn Leu Leu Phe Met Ser Val Arg Asp Gin Asp Arg Glu Gly 

3 5 130 135 140 

Ala Gly Arg Met Arg Val Val Gly Trp Gly Leu Gly Ser Ala Ser Pro 
145 150 155 160 

4 0 Glu Gly Gin Lys Ala Arg Asp Leu Arg Thr Pro Lys His Lys Val Ser 

165 170 175 



Glu Gin Arg Pro Gly Thr Pro Gly Pro Pro Pro Pro Glu Gly Ala Gin 
180 185 190 

Gly Asp Gly Cys Arg Gly Val Ala Phe Asn Val Mat Phe Ser Ala Pro 
195 200 205 



His lie * 
50 210 



(2) INFDPWRTICN FOR SEQ ID NO: 7: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) IENGTH: 752 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS: single 

(D) TOPOLOGY: linear 

5 

(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: NO 

10 (iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hcrro sapiens 

15 (ix) FEATURE: 

(A) NAME/KEY: 5'OTR 

(B) LOCATION: 1. .53 

(ix) EEATURE: 
2 0 (A) NAME/KEY: CDS 

(B) LOCATION: 54.. 596 

(ix) FEATURE: 

(A) NAME/KEY: 3'UTR 
2 5 (B) IOCATICN:597..752 



(xi) SECUE3SCE DESCRIPTION: SEQ ID NO: 7: 
3 0 A3XCTOGIGG GO0CTGACCT TCnCTCTTGAG AGCOQGQCAG A3GCIOOGGA GOC AIG 56 

Met 
l 

CAG GOC GAA GGC OGG GOC ACA GGG GGT TOG ACG GOC GAT GCT GAT GOC 104 
3 5 Gin Ala Glu Gly Arg Gly Thr Gly Gly Ser Thr Gly Asd Ala Asp Gly 
5 10 15 

OCA GGA GOC OCT GGC ATT OCT GAT GOC OCA GGG GOC AAT GCT GOC GOC 152 
Pro Gly Gly Pro Gly lie Pro Asp Gly Pro Gly Gly Asn Ala Gly Gly 
40 20 25 30 

OCA GGA GfiG GOG GGT GOC AOG GGC GOC AGA GGT OOC OGG GGC GCA GOG 200 
Pro Gly Glu Ala Gly Ala Thr Gly Gly Arg Gly Pro Arg Gly Ala Gly 
35 40 45 



45 



GCA GCA AGG GOC TOG GGG OOG GGA GGA GGC GOC O0G OGG GGT O0G O^T 248 
Ala Ala Arg Ala Ser Gly Pro Gly Gly Gly Ala Pro Arg Gly Pro His 
50 55 60 65 



50 GGC GGC GOG GCT TCA GGG CIG AAT GGA TQC TGC AGA TGC GGG GOC A3G 
Gly Gly Ala Ala Ser Gly Leu Asn Gly Cys Cys Arg Cys Gly Ala Arg 
70 75 80 



296 



GGG OOG GAG AGC GGC CTG CTT GAG TTC TAC CIC GOC ATG OCT TIC GOG 344 
5 5 Gly Pro Glu Ser Arg Leu Leu Glu Phe Tyr Leu Ala Met Pro Phe Ala 
85 90 95 

ACA OOC ATG GAA OCA GP& CTG GOC CGC AGG AGC CIG GOC CA3 GAT GOC 392 
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Thr Pro Met Glu Ala Glu Leu Ala Arg Arg Ser Leu Ala Gin Asp Ala 
• 100 105 110 

oca cog err ooc gtg oca gqg gtg err cig aag gag tic act gtg toc 440 

5 Pro Pro Leu Pro Val Pro Gly Val Leu Leu Lys Glu Hie Thr Val Ser 
115 120 125 

GQC AAC ATA CTG ACT ATC OGA CIG ACT OCT GCA GAC CAC OX CAA CIG 488 
Gly Asn lie Leu Thr lie Arg Leu Thr Ala Ala Asp His Arg Gin Leu 
10 130 135 140 145 

CAG CIC TOC ATC AGC TOC TGT CTC CAG CAG CTT TOC CTG TIG ATC TOG 536 
Gin Leu Ser lie Ser Ser Cys Leu Gin Gin Leu Ser Leu Leu Met Trp 
150 155 160 

15 

ATC AOG CAG TGC TTT CIG OOC GTG TTT TTG GOT CAG OCT OOC TCA GQG 584 
He Thr Gin Cys Phe Leu Pro Val Phe Leu Ala Gin Pro Pro Ser Gly 
165 170 . 175 

20 CPG AOG GQC TAA GCOCAGOCTG GOGOOOCITC CTAGGTCATG OCT0CT000C 636 
Gin Arg Arg * 
180 



25 



30 



50 



TAGQGAATGG TOOCAGCAOC ACTQ30CAGT TCATIGIGGG QGOCTGATTG TTTGICGCTG 696 
GAGGAGGAOG GCTEACATGT TIGTTTCICT AGAAAAXAAA ACIGAGCEAC GAAAAA 752 

(2) INFORMATION FOR SEQ ID NO: 8: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 180 amino acids 
3 5 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Gin Ala Glu Gly Arg Gly Thr Gly Gly Ser Thr Gly Asp Ala Asp 
15 10 15 

45 Gly Pro Gly Gly Pro Gly He Pro Asp Gly Pro Gly Gly Asn Ala Gly 
20 25 30 



Gly Pro Gly Glu Ala Gly Ala Thr Gly Gly Arg Gly Pro Arg Gly Ala 
35 40 45 

Gly Ala Ala Arg Ala Ser Gly Pro Gly Gly Gly Ala Pro Arg Gly Pro 
50 55 60 



His Gly Gly Ala Ala Ser Gly Leu Asn Gly Cys Cys Arg Cys Gly Ala 
55 65 70 75 80 

Arg Gly Pro Glu Ser Arg Leu Leu Glu Phe Tyr Leu Ala Met Pro Phe 
85 90 95 
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Ala Thr Pro Mat Glu Ala Glu Leu Ala Arg Arg Ser Leu Ala Gin Asp 
100 105 110 



5 Ala Pro Pro Leu Pro Val Pro Gly Val Leu Leu Lys Glu Phe Thr Val 
115 120 125 



10 



20 



Ser Gly Asn He Leu Thr He Arg Leu Thr Ala Ala Asp His Arg Gin 
130 135 140 

Leu Gin Leu Ser He Ser Ser Cys Leu Gin Gin Leu Ser Leu Leu Met 
145 150 155 160 



Trp He Thr Gin Cys Phe Leu Pro Val Phe Leu Ala Gin Pro Pro Ser 
15 165 170 175 



Gly Gin Arg Arg 
180 



(2) INFORMATION FOR SEQ ID NO: 9: 

2 5 <i) SEQUENCE CHARACTERISTICS: 

(A) ISS1CTH: 752 base pairs 

(B) TYPE: nucleic acid 

(C) STRANEEDNESS: single 

(D) TOPOLOGY: linear 

30 

(ii) MOLECULE: TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 

3 5 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: hate sapiens 

4 0 (ix) FEATURE: 

(A) NfiME/KEY: 5'UIR 

(B) I03ETCN:1..93 

(ix) FEATURE: 
45 (A) NAME/KEY: CDS 

(B) LOCATION: 94.. 270 

(ix) FEATURE: 

(A) NAME/KEY: 3'UTR 
50 (B) L0CATICN:271..752 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
55 ATDCICGTQG GCXX7TGACCT TCTCICTGAG AG003QQCAG AGGCTO0QGA GOCATQCW3G 



60 



O0GAAGQ0O3 GQQC&CfiGQO GGTTOQ^QG GCG ATC CIG ATG QCC CAG GAG GCC 114 

Met Leu Met Ala Gin Glu Ala 
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20 



z 
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CIG GCA TIC CTG AIG QOC GAG GOG GCA ATG CTG GOG GOC CAG GAG AGS 162 
leu Ala Phe Leu M*t Ala Gin Gly Ala Mat Leu Ala Ala Gin Glu Arg 
10 15 20 

GGG GIG CCA CGG GCG GCA GAG GTC OOC GGG GOG CAG GOG CAG CAA GGG 210 
Arg Val Pro Arg Ala Ala Glu Val Pro Gly Ala Gin Gly Gin Gin Gly 
25 30 35 

OCT CGG GGC CGG GAG GAG GOG OOC OOC GGG GTC CGC ATG GOG GCG OQG 258 
Pro Arg Gly Arg Glu Glu Ala Pro Arg Gly Val Arg Met Ala Ma Arg 
40 45 50 55 



15 CTT CAG GOC TGA ATGGATQCIG CAGATGOGGG GOCAGGQQGC CX3GAG&3O0G 
Leu Gin Gly * 



310 



TTCTAOCTOG OCA1Q0CTTT OGOGACADX ATGGAAGCAG AGCTGGOOOG 370 

CAGGAGCCTG GOOCAGGATG OOOCAOOGCT TO0OGTQOCA GQQGIGOTIC TCAAGGAGTT 430 

CACICTGTO: GGCAACATAC TGACTATOOG ACTGACTQCT QC3*3AOCACC QOCAACIGCA 490 

2 5 GCICT0CA1C AGCTOOTGTC TOCAGCAGCT TTOCCIGITG ATGTGGATCA OOCAGFQCTT 550 

TCIGCXX3GTG TTTTIG3CIC AGOCTOOCIC AGGQGAGAGG OQCTAAGO0C AGOCK3GOGC 610 

CXXTKXTAG GTCATOOCIC CTCCOCTAGG GAATQGIOOC AGCAOGAGIG GOCAGITCAT 670 

30 

TGTGGGQ30C: TGAITCTTTG TOGCTQGAGG AGGAOOQCTT ACATCTTIGT TTCICTAGAA 730 

AAIAAAACIG AGCTAOGAAA AA 752 

35 



(2) INKBXATICN FOR SEQ ID NO: 10: 

4 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 amino acids 

(B) TYPE: amino acici 
(D) TOPOLOGY: linear 

45 (ii) MDLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Leu Met Ala Gin Glu Ala Leu Ala Phe Leu Mat Ala Gin Gly Ala 
15 10 15 

Met Leu Ala Ala Gin Glu Arg Arg Val Pro Arg Ala Ala Glu Val Pro 
20 25 30 



Gly Ala Gin Gly Gin Gin Gly Pro Arg Gly Arg Glu Glu Ala Pro Arg 
10 35 40 .45 

Gly Val Arg Met Ala Ala Arg Leu Gin Gly * 
50 55 

15 



2) INFCRMATIOJ FOR SEQ ID NO: 11: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Met Leu Met Ala Gin Glu Ala Leu Ala Phe Leu 
30 1 5 10.... 



35 2) INEO^TICN FOR SEQ ID NO: 12: 

(i) SEQUENCE OiARACIERISTICS: 

(A) mOGTH: 10 amino acids 

(B) TYPE: amino acid 
40 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



45 



50 



Leu Met Ala Gin Glu Ala Leu Ala Phe Leu 
1 5 ' "10. 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: synthetic DMA. 
QSnjOm TAGAAQGTAC G 21 



(2) INK^mriON FOR SEQ ID NO: 14: 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) IENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS: single 
15 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: synthetic DNA 
20 TGATGTGCAA CIGA&3CAQG 20 



25 (2) INFDBMATICN ECR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) I£NSTH: 21 base pairs 

(B) TYPE: nucleic acid 

3 0 (C) STRANDECNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: synthetic CNA 

35 GCACIGOGTG AT0CSCATCA A 



40 (2) INFORMATION ICR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 
45 (C) STRANCEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: synthetic DMA 
5 0 OGACIOOA TAGGCMAGA G 
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(2) INKDRmriON FDR SEQ ID NO: 17: 

(i) SE3QCEMCE CHARACTERISTICS: 

(A) IENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANCEDNESS: single 

(D) TOPOLOGY: linear 



10 (ii) MOLECULE TYPE: synthetic CNA 

GCACATCAOG ATQOCTTTCT OCTOG 25 



(2) INFOR^TION FOR SEQ ID NO: 18: 



(i) SEQUENCE CHARACTERISTICS: 
2 0 (A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANCECNESS: single 

(D) TOPOLOGY: linear 

2 5 (ii) MOLECULE TYPE: synthetic ENA 

CACACAAAGC TTGQCITAGC GCCTCTQOOC TG 32 



(2) INRDPmriON FOR SEQ ID NO: 19: 



(i) SEQUENCE CHARACTERISTICS: 

3 5 (A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

4 0 (ii) MOIflCULE TYPE: synthetic DNA 

CACACAGGAT OCATGGATGC TQCAGAIGOG 30 



(2) INFORMATION FOR SEQ ID NO: 20: 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS: single 

(D) TOPOLOGY: linear 

55 (ii) MXBCULE TYPE: synthetic DNA 

GASGAACAIA TQCTGATGQC CCAQGAGGC 29 
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(2) INFORMATION FOR SBQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

5 

(B) TYPE: nucleic acid 

(C) SnWCEOSESS: single 

(D) TOPOLOGY: linear 

10 <ii) MOLECULE TYPE: synthetic DNA 

TTAAAGATCT CAGAAOOGOC OCTQGFOG 



15 



(2) INTOFJ^TION FDR SBQ ID NO: 22: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 25 base pairs 
2 0 (B) TYPE: nucleic acid 

(C) STRANDECNESS: single 

(D) TOPOLOGY: linear 



25 



(ii) MOLECULE TYPE: synthetic CNA 
ttactogaga tgctgatggc ccagg 25 



30 

(2) INFOPmriCN FOR SEQ ID NO: 23: 
(i) SEQUENCE CHARACTERISTICS: 

35 (A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS: single 

(D) TOPOLOGY: linear 

40 (ii) MOLECULE TYPE: synthetic DMA 

aaggtacctt gaacogcccc tggtcg 26 



45 

2) INKKWION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 24: 
Phe Leu Met Ala Gin Gly Ala Met Leu 



2) INTORmiTON FDR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) IENGTH: 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Ala Met Leu Ala Ala Gin Glu Arg Arg Val 
15 10 



2) INRPMATICN FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LHSJCTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 26: 

Met Leu Ala Ala Gin Glu Arg Arg Val 
15 9 



2) INFCR^TICN RR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LHSJGTH: 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) M0IECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

Tyr Tyr Met Asn Gly Thr Met Ser Gin Val 
15 10 



1 



5 



9 
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2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 9 amino acids 
5 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) lyDIJEEULE TYPE: protein 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Glu Val Asp Pro lie Gly His Leu Tyr 
15 9 

15 
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